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(54) Voice service system and method 



(57) A voice service system is provided with an in- 
teractive voice response unit (11) for interactively deal* 
ing with a call from a human caller (2), an operator sub- 
system (12) by which a human operator (25) can ver- 
bally interact with the caller, and a routing manager (1 0) 
for transferring handling of the call between the voice 



response unit (1 1 ) and the operator subsystem (1 2). The 
operator's verbal interaction with the caller is done 
through a syntheslsed voice in order to mask from the 
caller that they are now talking to a human operator. This 
permits the operator (25) to return handling of a call back 
to the voice response unit (1 1 ) without offending the call- 
er. 
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Description 

Field of the Invention 

[0001] The present invention relates to a method and 
system for providing voice services. 

Background of the Invention 

[0002] Automated Interactive voice services use 
voice synthesis technoiogy to converse with a cailer in 
a dialogue. Caiiers become used to this synthesised 
voice in the dialogue. However, it is not possible to com- 
pletely understand human speech or anticipate users 
requests and responses In automated systems. There- 
fore to assist in problem situations automated voice re- 
sponse systems generally provide a path to a human 
assistant (often refen^ed to as an operator or agent). 
Once this person is contacted, the change in voice cues 
the caller that a real person has taken over and can re- 
solve their problem. Typically, a brief period of human 
interaction can resolve the problem that the caller was 
experiencing and automatic response could be contin- 
ued. However, passing a caller bacl< to an automatic 
system is considered somewhat rude and the caller en- 
quiry is normally completed by the human operator. 
[0003] It is an object of the present invention to pro- 
vide a method and apparatus facilitating the more effi- 
cient use of operators In a voice response service. 

Summary of the Invention 

[0004] According to one aspect of the present inven- 
tion, there is provided a voice service system compris- 
ing: 

- an interactive voice response unit for Interactively 
dealing with a call from a human caller, 
an operator subsystem by which a human operator 
can verbally Interact with the caller, and 
a transfer arrangement for transferring handling of 
the call at least in one direction between the oper- 
ator subsystem and the voice response unit; 

the operator subsystem Including a masking arrange- 
ment for causing the operator's verbal interaction with 
the callerto be done through a synthesised voice where- 
by to mask from the caller that they are talking to a hu- 
man operator, the transfer arrangement being usable by 
the operator to have handling of a call transferred to the 
voice response unit. 

[0005] According to another aspect of the present in- 
vention, there is provided a voice service system com- 
prising an interactive voice response unitfor interactive- 
ly dealing with a call from a human caller, an operator 
subsystem by which a human operator can verbally in- 
teract with the caller, and transfer means for transferring 
handling qf the call between the voice response unit and 



the operator subsystem; the voice service system hav- 
ing masking means for causing the operator's verbal in- 
teraction with the caller to be done through a synthe- 
sised voice whereby to mask from the caller that they 

5 are now talking to a human operator, the transfer means 
being usable by the operator to have handling of a call 
transferred to the voice response unit. 
[0006] According to a further aspect of the present in- 
vention, there is provided a method of providing voice 

10 services in respect of a call placed by a human caller, 
the method comprising the steps of: 

(a) canning out an verbal Interaction between the 
caller and a human operator; 
'5 (b) at the instigation of the operator, transferring the 
call to an interactive voice response unit; and 
(c) continuing verbal interaction with the caller 
through the voice response unit. 

20 the operator's verbal interaction with the caller in step 
(a) being done through a synthesised voice whereby to 
mask from the caller that they are talking to a human 
operator. 

[0007] According to a still further aspect of the present 
25 invention, there is provided a method of providing voice 
sen^ices in respect of a call placed by a human caller, 
the method comprising the steps of: 

(a) enabling voice interaction between the caller 
30 and a voice response unit; 

(b) analysing the caller's interaction with the voice 
response unit to determine whether the caller re- 
quires operator assistance; 

(c) where this analysis indicates that operator as- 
35 sistance is required, transferring the call to a human 

operator; and 

(d) carrying out a verbal interaction between the 
caller and a human operator, this Interaction being 
done through a synthesized voice whereby to mask 

40 from the caller that they are talking to a human op- 
erator. 

Brief Description of the Drawings 

45 [0008] A voice sen/Ice system and method, both em- 
bodying the invention, will now be described, by way of 
non-limiting example, with reference to the accompany- 
ing diagrammatic drawings, In which: 

50 • Figure 1 is a diagram of a prior art voice service 
system; and 

• Figure 2 Is a diagram of a voice service system em- 
bodying the invention. 

55 Best Mode of Carrying Out th Invention 

[0009] Figure 1 shows a prior art voice service system 
8 which a cailer 2 can contact using equipment 3 to place 
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a voice call over a sound connection 7 (that is, a channel 
for signals representing sound) established through 
communications Infrastructure 6. The equipment 3 in- 
cludes microphone 4 and loudspeaker 5 and is, for ex- 
ample, a telephone. The communications infrastnjcture s 
is typically a telephone network (such as a PSTN or 
PLMN) but may also or alternatively comprise the inter- 
net. The sound channel 7 can, for example, be a circuit- 
switched voice circuit or a VoIP connection across a 
packet network. 

[0010] The voice service system 8 comprises an au- 
tomated Interactive voice response (IVR) unit 11 , an op- 
erator subsystem 12, and a routing manager 10 for 
transferring handling of a call between the IVR unit 11 
and the operator subsystem 12. 
[0011] The IVR unit comprises a plurality of channels 
each for handling a corresponding call and each sub- 
stantially of the form shown In Figure 1 . More particular- 
ly, on its input side each IVR channel comprises a 
speech recogniser 14 and a DTMF (Dual Tone Multi- 
Frequency) decoder 1 5 both of which feed an interaction 
manager 13 which decides what action to take on the 
basis of the received Inputs and a current Interaction 
script being followed. The cun^ent script is one loaded 
from a store 1 8 In dependence, for example, on the initial 
contact information associated with the caller (such as 
number dialled where the caller Is communicating over 
a telephone network, different numbers con-espondlng 
to different desired services); the current script may al- 
ternatively depend on an option selected by the user 
from a preceding script, option selection being done ei- 
ther by recognition of a spoken choice or by having the 
user press an appropriate key to generate a DTMF code 
recognised by the DTMF decoder. Basic IVR units may 
not Include a speech recogniser 1 4 and simply depend 
on the DTMF decoder 1 5 for collecting caller input; con- 
versely, where a speech recogniser is provided, the DT- 
MF decoder can be omitted. 

[0012] On Its output side, each IVR channel compris- 
es a text-to-speech converter 16 for turning output text, 
provided by the interaction manager 1 3 from the current 
script, into synthesised voice output, and an audio serv- 
er 1 7 for outputting richer sound material as command- 
ed by the Interaction manager. Again, one or other of 
the units 16, 17 can be omitted. 
[0013] The operator subsystem comprises, In the 
present case, a plurality of operator workstations 26 with 
telephony capability, each workstation 26 being con- 
nectable to receive a call routed to It by routing manager 
1 0 via a respective connection 21 . An operator 25, with 
headset unit 27, can hear and talk to a caller routed to 
their workstation 26 and can use the latter to pull up 
scripts from a store 28 and display them on the screen 
of the workstation 26 for use in conversing with the call- 
er. 

[0014] In operation ofthevolce service system, a call- 
er would nonnally be routed by routing manager 10 to 
an available channel of the IVR unit 11 and an appropri- 



ate script would then be run by the Interaction manager 
1 3 forthat channel (how scripts are selected has already 
been described above). 

[001 5] The caller will normally be informed as part of 
the opening of a script, that they can ask for operator 
assistant at any time by pressing a particular key. If the 
caller does this, the key press is detected by DTMF de- 
coder 15 causing the interaction manager 13 to Inform 
the routing manager 1 0. The routing manager then pro- 
ceeds to find a free operator 25 and routes the call to 
the workstation 26 associated with that operator. 
[001 6] The operator converses with the caller to solve 
their problem and generally handles the call to comple- 
tion. 

[0017] Figure 2 shows an implementation of the 
present invention as applied to the voice service system 
of Figure 1 . More particularly, in the Figure 2 arrange- 
ment, upon a call being transferred to an operator 25, 
the operator's output to the caller is provided or convert- 
ed Into text message form and passed to a text-to- 
speech converter so as to be presented to the caller In 
a synthesised voice. As a result, the caller Is masked 
from the fact that they are talking to a human operator 
which makes It socially easier for the operator to pass 
handling of the call back to the IVR unit. 
[0018] In the Figure 2 embodiment, the switching of 
call handling to an operator is triggered by an analysis 
unit 35 that analyses the user's Inputs to detemnine 
when a caller appears to be having difficulty - this may 
be indicated by an illogical sequence of choices, by de- 
tecting words of frustration, or by similar indicators. The 
analysis unit 35 informs a mode control unit 36 of the 
interaction manager which puts the latter In a passive 
mode where It ceases to be in control of handling the 
call; however, the IVR channel concerned remains as- 
sociated with the call. The mode control unit also informs 
the routing manager 1 0 that the call should also be rout- 
ed to a human operator which the routing manages then 
does, placing the operator subsystem in charge of han- 
dling the call. 

[0019] The mode control 36 and analysis unit 35 to- 
gether with the routing manager 10, form a transfer ar- 
rangement for transferring the call between the IVR 
channel and the operator subsystem; as will be seen 
below, in the Figure 2 embodiment, this transfer can be 
in either direction, with transfer from the operator sub- 
system to the IVR channel being effected under opera- 
tor control. 

[0020] It may be noted that the transfer of call-han- 
dling control to the operator subsystem Is preferably not 
done by specific caller selection (e.g. by an appropriate 
key press) as this creates an expectation of a human 
voice, not a synthesised one; however, transfer can be 
inltlatedbyspeclficcallerselectlon In appropriate cases. 
[0021] Once call handling has been transferred to the 
operator subsystem, the operator can proceed in sev- 
eral ways to produce text messages for output to the 
caller. Including: 
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by typing in nnessages using the keyboard of work- 
station 26; 

by selecting scripts called up onto the workstation 
display from store 28; 

by voice input to speech recogniser 30 that produc- 
es a text output, this latter preferably being fedback 
to the workstation for display and editing as required 
by the operator. 

The text messages are passed from workstation 26 over 
connection 31 (for example, a LAN) to the text-to- 
speech converter 16 of the IVR channel still associated 
with the call. In this manner, the output from the operator 
is presented to the caller in the identical synthesised 
voice as used when the IVR channel was in charge of 
handling the call. 

[0022] Because the resources of the IVR channel are 
still associated with the call, the operator can also call 
upon the audio server to play any material considered 

appropriate. 

[0023] When the operator has solved the caller's cur- 
rent difficulty, control of handling the call is passed back 
to the interaction manager 13 by the operator using 
workstation 26 to signal to the mode control unit 36, this 
latter then being responsible for changing the operating 
mode of the interaction unit back into an active state, 
and for informing the routing manager 10 that the call 
can be taken back from the operator subsystem, thereby 
freeing up the operator for another call. Of course, 
where the operator deals fully with the caller and their 
reason for calling, the operator simply tennlnates the 
call rather than passing it back to the IVR channel. 
[0024] The advantages of the Figure 2 arrangement 
include that the user experience is preserved and that 
operator intervention is limited to places where help is 
really required. 

[0025] Many variants are, of course, possible to the 
Figure 2 arrangement. For example, the IVR channel 
could be freed up when a call is transferred to the oper- 
ator subsystem in which case a separate text-to-speech 
converter would be required to produce a synthesised 
voice output for passing to the caller. 
[0026] Furthemnore, determining when an operator 
should intervene could be done by "eavesdropping" on 
the exchanges between callers and the IVR channels; 
this eavesdropping can probably best be done by view- 
ing a transcript of the exchanges rather than by directly 
listening in since a transcript gives a history of the pro- 
gression of an exchange. The eavesdropping approach 
can be implemented additionally or alternatively to the 
automatic analysis effected by unit 35. 
[0027] The operator's proposed text message re- 
sponses can be automatically analysed and corrected 
for grammar and complexity and can even be changed 
to a style more like that used in the scripts run by the 
interaction manager 13. 

[0028] An incoming call can be passed directly to the 
operator subsystem without the user having first been 



through an IVR interaction and the analysis unit 35 hav- 
ing detected a problem requiring operator inten^ention; 
for example, the DTMF decoder 15 may detect an Input 
indicating an issue needing direct operator input. In such 

5 cases, it is still advantageous for the operator's input to 
be masked by being provided through the text-to- 
speech converter 16 for the reasons already noted, 
namely to disguise the transfer of the user to an IVR 
interaction. Indeed, all calls could initially be routed to a 

10 operator, the operator then deciding on the basis of their 
initial interaction with each caller whether the caller can 
be transfen^ed to a standard IVR script. Thus the transfer 
arrangement (which in Figure 2 comprised the mode 
control 35, the analysis unit 35, andthe routing manager 

15 1 0) may in certain cases simply provide for transferring 
a call from the operator subsystem to the IVR channel 
without providing for the reverse transfer (in which case, 
the transfer arrangement may simply comprise the rout- 
ing manager with operator input to control transfer to an 

20 IVR channel). 



Claims 

25 1 . A voice sen/ice system comprising: 

an interactive voice response unit for interac- 
tively dealing with a call from a human caller, 
an operator subsystem by which a human op- 
30 erator can verbally interact with the caller, and 

a transfer arrangement for transferring han- 
dling of the call at least in one direction between 
the operator subsystem and the voice response 
unit; 

35 

the operator subsystem including a masking ar- 
rangement for causing the operator's verbal inter- 
action with the caller to be done through a synthe- 
sised voice whereby to mask from the caller that 
40 they are talking to a human operator, the transfer 
arrangement being usable by the operator to have 
handling of a call transferred to the voice response 
unit. 

45 2. A voice service system according to claim 1 , where- 
in the masking arrangement comprises text re- 
sponse means for generating text messages from 
the operator, and means for passing these messag- 
es to a text-to-speech converter for output to the 

50 caller. 

3. A voice sen/ice system according to claim 2, where- 
in the text-to-speech converter is part of the voice 
response unit and provides the same synthesised 
55 voice to the caller whether the call Is being handled 
by the operator subsystem or by the voice response 
unit. 
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4. A voice service system according to claim 2, where- 
in the text response means comprises a Iceyboard 
for operator entry of text messages. 

5. A voice service system according to ciaim 2, where- 
in the text response means comprises a speech 
recogniser for receiving voice input from the oper- 
ator and generating text messages. 

6. A voice service system according to claim 5, where- 
in the text messages output by the speech recog- 
niser are passed to an editing console of the oper- 
ator subsystem to enable the operator to check and 
edit the messages prior to output to the text-to- 
speech converter. 

7. A voice service system according to any one of the 
preceding claims, wherein the transfer arrange- 
ment includes an analysis subsystem for analysing 
the caller's Inputs when the voice response unit is 
handling the call whereby to determine whether the 
caller requires operator assistance; the analysis 
subsystem being operative, upon detemninlng that 
the caller requires operator assistance, to cause the 
transfer arrangement to transfer the call to the op- 
erator subsystem. 

8. A voice service system comprising an Interactive 
voice response unit for interactively dealing with a 
call from a human caller, an operator subsystem by 
which a human operator can verbally interact with 
the caller, and transfer means for transferring han- 
dling of the call between the voice response unit and 
the operator subsystem; the voice service system 
having masking means for causing the operator's 
verbal interaction with the caller to be done through 
a synthesised voice whereby to mask from the call- 
er that they are now talking to a human operator, 
the transfer means being usable by the operator to 
have handling of a call transferred to the voice re- 
sponse unit. 

9. A method of providing voice services In respect of 
a call placed by a human caller, the method com- 
prising the steps of: 

(a) carrying out an verbal interaction between 
the caller and a human operator; 

(b) at the Instigation of the operator, transferring 
the call to an interactive voice response unit; 
and 

(c) continuing verbal interaction with the caller 
through the voice response unit. 

the operator'6 verbal interaction with the caller in 
step (a) being done through a synthesised voice 
whereby to mask from the caller that they are talking 
to a human operator. 



10. A method according to claim 9, wherein the opera- 
tor's verbal interaction with the caller involves gen- 
erating a text message from operator input and 
passing this message through a text-to-speech 

5 converter to output the operator input in said syn- 
thesized voice. 

11. A method according to claim 10, wherein the text- 
to-speech converter is part of the voice response 

10 unit and provides the same synthesised voice to the 
caller whether the call Is Interaction with the opera- 
tor or the voice response unit. 

12. A method according to claim 10, wherein the oper- 
as ator generates the text message using a keyboard. 

13. A method according to claim 10, wherein the oper- 
ator generates the text message through a speech 
recogniser. 

20 

14. A method according to claim 13, wherein the text 
message output by the speech recogniser is 
checked and, where required, edited by the opera- 
tor at an editing console prior to output to the text- 

25 to-speech converter. 

15. A method of providing voice services in respect of 
a call placed by a human caller, the method com- 
prising the steps of: 

30 

(a) enabling voice interaction between the call- 
er and a voice response unit; 

(b) analysing the caller's Interaction with the 
voice response unit to determine whether the 

35 caller requires operator assistance; 

(c) where this analysis indicates that operator 
assistance is required, transferring the call to a 
human operator; and 

(d) carrying out a verbal interaction between the 
40 caller and a human operator, this Interaction be- 
ing done through a synthesized voice whereby 
to mask from the caller that they are talking to 
a human operator. 

45 16. A method according to claim 15, wherein the oper- 
ator's verbal interaction with the caller involves gen- 
erating a text message from operator input and 
passing this message through a text-to-speech 
converter to output the operator input in said syn- 

50 thesized voice. 

17. A method according to claim 16, wherein the text- 
to-speech converter is part of the voice response 
unit and provides the same synthesised voice to the 

55 caller whether the call is interaction with the opera- 
tor or the voice response unit. 

18. A method according to claim 16, wherein the oper- 
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ator generates the text message using a keyboard. 

19. A method according to claim 16, wherein the oper- 
ator generates the text message through a speech 
recogniser. s 
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